Distributed Algorithms for Scheduling on Line and Tree Networks 



C/3 



Venkatesan T. Chakaravarthy Sambuddha Roy Yogish Sabharwal 

IBM Research Lab, New Delhi, India 
{ vechakr a , sambuddha,y sabharwal } @in . ibm . com 



Abstract 

(N 

We have a set of processors (or agents) and a set of graph networks defined over some vertex 
set. Each processor can access a subset of the graph networks. Each processor has a demand 
[ specified as a pair of vertices (u, v), along with a profit; the processor wishes to send data 

■ between u and v. Towards that goal, the processor needs to select a graph network accessible to 

it and a path connecting u and v within the selected network. The processor requires exclusive 
access to the chosen path, in order to route the data. Thus, the processors are competing for 
routes/channels. A feasible solution selects a subset of demands and schedules each selected 
demand on a graph network accessible to the processor owning the demand; the solution also 



specifies the paths to use for this purpose. The requirement is that for any two demands 
scheduled on the same graph network, their chosen paths must be edge disjoint. The goal is to 
output a solution having the maximum aggregate profit. Prior work has addressed the above 
CI problem in a distibuted setting for the special case where all the graph networks are simply 

paths (i.e, line-networks). Distributed constant factor approximation algorithms are known for 
. this case. 

Q\ ' The main contributions of this paper are twofold. First we design a distributed constant 

factor approximation algorithm for the more general case of tree- networks. The core compo- 
\f} | nent of our algorithm is a tree-decomposition technique, which may be of independent interest. 

Secondly, for the case of line-networks, we improve the known approximation guarantees by a 
factor of 5. Our algorithms can also handle the capacitated scenario, wherein the demands and 
edges have bandwidth requirements and capacities, respectively. 

1 Introduction 



Consider the following fundamental scheduling/routing problem. We have a set V consisting of n 
points or vertices. A set of r undirected graphs provide communication networks over these vertices. 
All the edges in the graphs provide a uniform bandwidth, say 1 unit. There are m processors (or 
agents) each having access to a subset of the communication networks. Each processor P has 
a demand/job a specified as a pair of vertices u and v, and a bandwidth requirement (or height) 
h(a) < 1. The processor P wishes to send data between u and v , and for this purpose, the processor 
can use any of the networks G accessible to it. To send data over a network G, the processor P 
requires a bandwidth of h(a) along some path (or route) connecting the pair of vertices u and v 
in G. The input specifies a profit for each demand. A feasible solution is to select a subset of 
demands and schedule each selected demand on some graph-network. For each selected demand 
(u, v) scheduled on a graph-network G, the feasible solution must also specify which path connecting 
u and v must be used for transmission. The following conditions must be satisfied: (i) Accessibility 
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requirement: If a demand (u, v) owned by a processor P is scheduled on a graph-network G, then 
P should be able to access G; (ii) Bandwidth requirement: For any network G and for any edge e in 
G, the sum of bandwidth requirements of selected demands that use the edge e must not exceed 1 
unit (the bandwidth offered by the edge). We call this the throughput maximization proble 770. We 
shall refer to the special case of the problem wherein the heights of all demands is 1 unit as the unit 
height case. In this case, we see that the paths of any two demands scheduled on the same network 
should be edge disjoint. The general case wherein the heights can be arbitrary will be referred to 
as the arbitrary height case. 

It is known that the throughput maximization problem is NP-hard to approximate within a 
factor of ^(log 1 / 2- ^ n), even for the unit height case of a single graph-network [Ij. Constant factor 
approximations are known for special cases of the throughput maximization problem (c.f. |10j). 
Our goal in this paper is to study the problem in a distributed setting. Prior work has addressed 
the problem in a distributed setting for the special case of line networks. In our paper, we present 
distributed algorithms for the more general case of tree networks and also improve the known ap- 
proximation ratios for the case of line networks. We first discuss the concept of line networks and 
summarize the known sequential and distributed algorithms for this case. 

Line-Networks: A line-network refers to a graph which is simply a path. Consider the special 
case of the throughput maximization problem wherein all the graph-networks are identical paths; 
say the path is 1,2, ... ,n. We can reformulate this special case by viewing the path as a timeline. 
We visualize each edge (i, i + 1) as a timeslot so that the number of timeslots is n — 1 , say numbered 
1,2, ... ,n — 1; then the timeline consisting of these timeslots becomes a range [l,n — 1]. Each 
demand pair (u, v) can be represented by the timeslots it, u + 1, . . . ,v — 1 and can be viewed as a 
interval [u, v — 1]. Thus, each demand can be assumed to be specified as an interval [s,e], where 
s and e are the starting and ending timeslots. Each graph network can be viewed as a resource 
offering a uniform bandwidth of 1 unit throughout the timeline. We see that a feasible solution 
selects a set of demands and schedules each demand on a resource accessible to the processor 
owning the demand such that for any resource and any timeslot, the sum of heights of the demands 
scheduled on the resource and active at the timeslot does not exceed 1 unit. The goal is to choose 
a subset of demands with the maximum throughput. See Figured] for an illustration. 

In natural applications, a demand may specify a window [rt, dl] (release time and deadline) 
where it can be executed and a processing time p. The job can be executed on any time segment 
of length p contained within the window. The rest of the problem description remains the same 
as above. In the new setup, apart from selecting a set of demands and determining the resources 
where they must be executed, a feasible solution must also choose a execution segment for each 
selected demand. As before, the accessibility and the bandwidth constraints must be satisfied. The 
goal is to find a feasible solution having maximum profit. 

The throughput maximization problem on line-networks has been well-studied in the realm of 
classical, sequential computation. For the arbitrary height case, Bar-Noy et al. [4] presented a 5- 
approximation algorithm. For the unit height case, Bar-Noy et al. [3], and independently Berman 
and Dasgupta |5j presented 2-approximation algorithms; both these algorithms can also handle the 

lr The generalization in which the bandwidths offered by edges can vary has also been studied. For the case where 
there is only one graph, this is known as the unsplittable flow problem (UFP), which has been well-studied (see survey 
|12|). In this paper, we shall only consider the case where the bandwidth offered by all the edges are uniform, say 1 
unit 
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Figure 1: Illustration for the problem on line- networks. The bandwidth/capacity offered by the 
resource is 1 unit throughout the timeline. The sets of demands {^4, C} and {B, C} can be scheduled 
on the resource, but both A and B cannot be scheduled on the same resource. 

notion of windows. Generalizations and special cases of the problem have also been studiec@. 

Panconesi and Sozio |15l [T6] studied the throughput maximization problem on line-networks 
in a distributed setting. In this setup, two processors can communicate with each other, if they 
have access to some common resource. We shall assume the standard synchronous, message passing 
model of computation: in a given network of processors, each processor can communicate in one 
step with all other processors it is directly connected to. The running time of the algorithm is 
given by the number of communication rounds. This model is universally used in the context of 
distributed graph algorithms. We require that the local computation at any processor takes only 
polynomial time. To be efficient, we require the communication rounds to be polylogarithmic in 
the input size. We can construct a communication graph taking the processors to be the vertices 
and drawing an edge between two processors, if they can communicate (i.e., they share a common 
resource). Notice that the diameter of the communication graph can be as large as the number 
of processors m. So, there may be a pair of processors such that the path connecting them has a 
large number of hops (or edges). Hence, within the stipulated polylogarithmic number of rounds, 
it would be infeasible to send information between such a pair of processors. The above fact makes 
it challenging to design distributed algorithms with polylogarithmic number of rounds. 

Under the above model, Panconesi and Sozio [16] designed distributed approximation algorithms 
for the throughput maximization problem on line networks. For the case of unit height demands, 
they presented an algorithm with an approximation ratio of (20 + e) (throughout the paper, e > is 
a constant fixed arbitrarily). For the general arbitrary height case, they devised an algorithm with 
an approximation ratio of (55 + e). Both the above algorithms can also handle the notion of win- 

For the case where there is only one line-network and there are no windows, improved approximations are known 
[HE]- The UFP problem on line- networks (where the bandwidth offered varies over the timeline) has also been well 
studied (see [51 [21 [51 ITU1 111] ) and a constant factor approximation algorithm is known |B]. 
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Here, L max and L m i n are the maximum and minimum length of any demand, and p max and p m \ n 
are the maximum and minimum profit of any demand. The value /i m in is the minimum height 
of any demand (recall that all demand heights are at most 1 unit); in the case of unit height de- 
mands, /i m in = 1- The value Time(MIS) is the number of rounds needed for computing a maximal 
independent set (MIS) in general graphs. The randomized algorithm of Luby [13] can compute 
MIS in 0(log N) rounds, where N = nmr (n, m and r are the number of timeslots, demands and 
resources, respectively); if this algorithm is used, then the overall distributed algorithm would also 
be randomized. Alternatively, via network-decompositions, [17] present a deterministic algorithm 



Our Contributions: In this paper, we make two important contributions. The first is that we 
provide improved approximation ratios for the throughput maximization problems on line-networks 
addressed by Panconesi and Sozio [16] . Secondly, we present distributed approximation algorithms 
for the more general case of tree-networks. A tree-network refers to a graph which is a tree. 
Notice that in a tree, the path between a pair of vertices u and v is unique and so, it suffices if the 
feasible solution schedules each selected demand on a tree-network and the paths will be determined 
uniquely (see Figure [2]) . 

Prior work has addressed the throughput maximization problem for the scenario where the 
input consists of a single tree- network (and all processors have access to the sole tree-network). 
Under this setup, Tarjan showed that the unit height case can be solved in polynomial time [18]. 
Lewin-Eytan et al. |13| presented a 5-approximation algorithm for the arbitrary height case. In 
the setting of multiple tree- networks, the problem is NP-hard even for the unit height case. By 
extending the algorithm of Lewin-Eytan et al., we can show that the problem can be approximated 
within a factor of 3 and 8, for the unit height and arbitrary height cases, respectively. 

One of the main goals of the current paper is to design distributed algorithms for the throughput 
maximization problems on tree- networks. Our main result is: 

Main result: We present a distributed (7 + e)-approximation algorithm for the unit 
height case of the throughput maximization problem on tree-networks. 

The number of communication rounds is polylogarithmic in the input size: 0(Time(MIS) • (1/e) • 
logn • log(p max /p m i n )). Here, n is the number of vertices; p max and p m \ a are the maximum and 
minimum profits. Time(MIS) is the number of rounds taken for computing MIS in arbitrary graphs 
with N vertices, where N = mr (m is the number of processors/demands and r is the number of 
input tree-networks). As in the work of Panconesi and Sozio [16], the size of each message is O(M) 
where M is the number of bits needed for encoding the information about a demand (such as its 
profit, end-points and height). 

Recall that Panconesi and Sozio [16] presented a distributed (20 + e)-approximation algorithm 
for the unit height case of the line-networks problem. The main result provides improvements over 
the above work along two dimensions: the new algorithm can handle the more general concept of 
tree-networks and simultaneously, it offers an improved approximation ratio. 

Extending the main result, we design a distributed (80 + e)-approximation algorithm for the 
arbitrary height case of the tree-networks problem The number of communication rounds taken by 
this algorithm is 0(Time(MIS) • (1/e) • (l//i m in) • logn • log(p max /p mm )). This algorithm assumes 
that the value h m [ n is known to all the processors. Alternatively, we assume that a value /i m i n is 



with Time(MIS) = 0(2^°^). 
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Figure 2: Tree-Networks: There are three demands (1, 10), (2,3) and (12, 13). In the unit height 
case, only one of the three demands can be scheduled on the given tree-network (because they all 
share the edge (4, 5)). To illustrate the arbitrary height case, suppose their heights are 0.4, 0.7 and 
0.3, respectively. Then, the first and third demand can be scheduled together. 

fixed a priori and all the demands are required to have height at least h m i n . 

Next, we provide a improved approximation ratios for the case of line- networks with windows. 
We design distributed algorithms with approximation ratios (4 + e) and (23 + e), for the unit height 
case and arbitrary height case, respectively El- The number of communication rounds taken by these 
algorithms is the same as that of Panconesi and Sozio |16j . 

Proof Techniques and Discussion: At a technical level, our paper makes two main contribu- 
tions. The algorithms of Panconesi and Sozio [16], as well as our algorithms, go via the primal-dual 
method (see [19|). The sequential algorithms of Bar-Noy et al. [4] and Lewin-Eytan et al. |13j use 
the local ratio technique, but they can also be reformulated as primal-dual algorithms. Given a de- 
mand/job, there are multiple tree-networks (or line-networks) where the demand can be scheduled 
and we call each such possibility as a demand instance. All of the above algorithms work in two 
phases: in the first phase, a subset of candidate demand instances are identified and an assignment 
to dual variables is computed. In the second phase, the candidate set is pruned and a feasible 
solution is constructed. The dual assignment is used as a lowerbound for the optimal solution, by 

3 The conference version of the paper [5] claimed approximation ratios of (20 + e) and (11 + e) for the arbitrary 
height case of tree and line networks, respectively. However, there was a minor error in analyzing the approximation 
guarantee of the algorithm. The error is fixed in the current paper with an increase in the ratios 
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appealing to the weak-duality theorem. In fact, approximation algorithms for many other packing 
problems utilize the above two-phase strategy. 

We first formulate the above two-phase method as a framework. An important feature of the 
framework is that any algorithm following the framework must produce an ordering of the demand 
instances and also for each demand instance, it must determine the edges along the path whose dual 
variables will be increased (or raised). The ordering and the chosen edges should satisfy a certain 
property called the "interference property". The number of edges chosen, denoted A, is a factor in 
determining the approximation ratio. In the case of line-networks, Panconesi and Sozio |16j classify 
the demand instances into logarithmic many groups based on their lengths and obtain an ordering 
with A = 3. In the case of tree-networks, it is more challenging to design an ordering satisfying the 
interference property. Towards that goal, we introduce the notion of "tree-decompositions". The 
efficacy of a tree-decomposition is measured by its depth and "pivot size" 9. As it turns out, the 
pivot size 6 determines the parameter A and the depth determines the number of rounds taken by 
the algorithm. Our first main technical contribution is a tree-decomposition with depth O(logn) 
and pivot size 9 = 2. Using this tree-decomposition, we show how to get an ordering with A = 6. 
Our tree-decompositions may be of independent interest. 

Another feature of the framework is that an algorithm following the framework should produce 
an assignment for the dual variables in the first phase. This assignment need not form a dual feasible 
solution, but it should be approximately feasible: the dual assignment divided by a parameter A 
(0 < A < 1) should yield a feasible solution. The approximation ratio is inversely related to 
the parameter A. The algorithm of Panconesi and Sozio [16] produces a dual assignment with 
parameter A = 1/(5 + e). Our second main technical contribution is a method for constructing a 
dual assignment with parameter A = (1 — e). Thus, we get a improved approximation ratios for the 
case of line- networks. 

2 Unit Height Case of Tree Networks: Problem Definition 

The input consists of a vertex set V containing n vertices, a set of m processors V, a set of m 
demands A and a set of r tree-networks T (each defined over the vertex-set V). A demand a G A is 
specified as a pair of vertices a = (u, v) and it is associated with a profit p(a); u and v are called the 
end-points of a. Each processor P G V owns a unique demand a G A. For each processor PgP, 
the input also provides a set Acc(P) C T that specifies the set of tree-networks accessible to P. Let 
Pmax and p m in be the maximum and minimum profits. We will assume that all the tree-networks 
are connected. Note that the tree-networks can have different sets of edges and so, they are allowed 
to define different trees. 

A feasible solution S selects a set of demands 5Ci and schedules each a G S on some tree- 
network T G T ■ The feasible solution must satisfy the following properties: (i) for any a G S, if a 
is owned by a processor P and a is scheduled on a tree-network T, then P must be able to access 
T (i.e., T G Acc(-P)); (ii) for any two selected demands a\ = (u\,v\) and ai = (1*2,^2), ^ both 
a\ and 02 are scheduled on the same tree-network T, then the path between u\ and vi, and the 
path between U2 and x>2 in the tree-network T must be edge-disjoint (meaning, the two paths must 
not share any edge). The profit of solution S is defined to be the sum of profits of the selected 
demands; this is denoted p(S). The problem is to find the maximum profit feasible solution. 

We next present a reformulation of the problem, which will be more convenient for our discus- 
sion. Consider each demand a G A and let P be the processor which owns a. For each tree-network 
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T £ Acc(P), create a copy of a with the same end-points and profit; we call this the demand 
instance of a belonging to the tree-network T. Let T> denote the set of all demand instances over 
all the demands; each demand instance d £ T> can represented by its two end-points and the tree- 
network to which it belongs. For a demand a owned by a processor P, let Inst (a) denote the set 
of all instances of a (we have |Inst(a)| = |Acc(P)|). The profit of a demand instance d £ T> is 
defined to be the same as that of the demand to which it belongs; we denote this as p(d). A feasible 
solution selects a subset of demand instances S C V such that: (i) for any two demand instances 
di,d2 £ S, if d\ and di belong to the same tree- network T, then their paths (in the tree- network 
T) do not share any edge; (ii) for any demand a £ A., at most one demand instance of a is selected. 
The profit of the solution is the sum of profits of the demand instance contained in it. The goal is 
to find a feasible solution of maximum profit. 

The communication among the processors is governed by the following rule: two processors 
Pi and P2 are allowed to communicate, if they have access to some common resource (Acc(Pi) PI 
Acc(P 2 ) + 0). 

Notation: The following notation will be useful in our discussion. Let £ denote the set of all 
edges over all the tree- networks; any edge e £ £ is represented by a triple (u,v,T), where u and v 
are vertices of e and T is the tree-network to which e belongs. For a tree-network T, let T>(T) denote 
the set of all demand instances belonging to T. Any demand instance d £ D(T) can be viewed as 
a path in T and we denote this as path(d). For a demand instance d £ V(T) and an edge e in T, 
we say that d is active on the edge e, if the path(d) includes e; this is denoted d ~ e. We say that 
two demand instances d\ and c?2 are overlapping, if d\ and c?2 belong to the same tree-network, and 
path(di) and path(c?2) share some edge; the demands are said to non-overlapping, otherwise. Two 
demand instances d\ and di are said to be conflicting, if both d\ and di belong to the same demand 
or they overlap; otherwise, the demands are said to be non-conflicting. We shall alternatively use 
the term independent to mean a pair of non-conflicting demands. A set of demand instances D is 
said to be independent set, if every pair of demand instances in D is independent. Notice that a 
feasible solution is nothing but an independent set of demand instances. 

3 LP and the Two-phase Framework 

Our algorithm uses the well-known primal-dual scheme and goes via a two-phase framework. We 
first present the primal and the dual LPs and then discuss the framework. 

3.1 LP Formulation 

The LP and its dual are presented below. For each demand instance d € D, we introduce a primal 
variable x(d). The first set of primal constraints capture the fact that a feasible solution cannot 
select two demand instances active on the same edge. Similarly, the second set of primal constraints 
capture the fact that a feasible solution can select at most one demand instance belonging to any 
demand. For each demand aei and each edge e £ £ , the dual includes a variable a{a) and /3(e), 
respectively. Similarly, for each demand instance d £ V, the dual includes a constraint; we call 
this the dual constraint of d. Let ad denote the demand to which a demand instance d belongs. 



7 



max 



5>(d)-p(d) 



x ( d ) < 1 (VeG£) 

x(d) < 1 (VaG.4) 

ciglnst(a) 

> (Vd£V) 



min a(a) + ^J/3(e) 

ae.4 ee£ 

(a d )+ £ /3(e) > (VdGD) 



e : df 



a(a) > (Va € A) 
/3(e) > (VeG£) 



3.2 Two-phase framework 

We formulate the ideas implicit in |16l HlfT5] in the form of a two-phase framework, described next. 
Our algorithm would follow this framework. 

First Phase: The procedure initializes all the dual variables a(-) and /?(•) to and constructs 
an empty stack, and then it proceeds iteratively. Consider an iteration. Let U be the set of all 
demand instances whose dual constraints are still unsatisfied. We select a suitable independent 
set I C U (how to select I is clarified below). For each d G /, we wish to increase (or raise) the 
value of the dual variables suitably so that the dual constraint of d is satisfied tightly (i.e., the 
LHS becomes equal to the RHS). For this purpose, we adopt the following strategy. Consider each 
demand instance d G /. We first determine the slackness s of the constraint, which is the difference 
between the LHS and RHS of the constraint: s = p(d) — (a(a^) + Y^ e ■ d~e @( e ))- We next select a 
suitable subset ir(d) consisting of edges on which d is active (how to select ir(d) is clarified below). 
Next we compute the quantity 5(d) = s/(\n(d)\ + 1). We then raise the value of a(ad) by the 
amount 5(d); and for each e G ir(d), we raise dual variable (3(e) by the amount 5(d). We see that 
the dual constraint is satisfied tightly in the process. The edges ir(d) are called the critical edges of 
d. We say that the demand instance d is raised by the amount 5(d). Finally, the independent set / 
is pushed on to the stack (as a single object). This completes an iteration. In the above framework, 
in each iteration, we need to select an independent set I and the critical set of edges ir(d) for each 
d G /. These are left as choices that must be made by the specific algorithm constructed via this 
framework. Similarly, the algorithm must also decide the termination condition for the first phase. 

Second Phase: We consider the independent sets in the reverse order and construct a solution 
S, as follows. We initialize a set D = and proceed iteratively. In each iteration, the independent 
set / on the top of the stack is popped. For each d G /, we add d to D, if doing so does not violate 
feasibility (namely, D U {d} is an independent set). The second phase continues until the stack 
becomes empty. Let S = D be the feasible solution produced by the second phase. This completes 
the description of the framework. 

An important aspect of the above framework is that is parallelizable. The set I chosen in each 
iteration of the first phase is an independent set. Hence, for any two demand instances d\,d2 G /, 
the LHS of the constraints of d\ and d<i do not share any dual variable. Consequently, all the 
demand instances d G / can be raised simultaneously. 

As we shall see, we can derive an approximation ratio for any algorithm built on the above 
framework, provided it satisfies the following condition, which we call the interference 'property: for 
any pair of overlapping demand instances d\ and c?2 raised in the first phase, if d\ is raised before 
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d,2, then path(c?2) must include at least one of the critical edges contained in ir(di). 

The following notation is useful in determining the approximation ratio. Let £ G [0, 1] be any 
real number. At any stage of the algorithm, we say that a demand instance d G T> is ^-satisfied, if 
in the dual constraint of d, the LHS is at least £ times the RHS: a(ad) + Yl e ■ d~e/^( e ) > £ ■ p(d). 
If the above condition is not true, then we say that d is ^-unsatisfied. 

We shall measure the efficacy of an algorithm following the above framework using three param- 
eters. (1) Critical set size A: Let A be the maximum cardinality of 7r(cZ), over all demand instances 
d raised by the algorithm. (2) Slackness parameter A: Let A G [0, 1] be the largest number such that 
at the end of the first phase, all the demand instances d G T> are A-satisfied. (3) Round complexity: 
The number of iterations taken by the first phase. The parameters A and A will determine the 
approximation ratio of the algorithm; we would like to have A to be small and A to be close to 1. 
The round complexity determines the number of rounds taken by the algorithm when implemented 
in a distributed setting. We say that the algorithm is governed by the parameters A and A. 

The following lemma provides an approximation guarantee for any algorithm satisfying the 
interference property. The lemma is similar to Lemma 1 in the work of Panconesi and Sozio [15J. 
Let Opt denote the optimal solution to the input problem instance. 

Lemma 3.1 Consider any algorithm satisfying the interference property and governed by parame- 
ters A and A. Then the feasible solution S produced by the algorithm satisfies p(S) > aTi) '-P(Opt)- 

Proof: At the end of the first phase, the algorithm produces dual variable assignments a(-) and 
/?(•). Even though this assignment may not form a dual feasible solution, it ensures that all the 
demand instances are A-satisfied; (intuitively, all the dual constraints are approximately satisfied). 
It is easy to convert the assignment (a, j3) into a dual feasible solution by scaling the values by an 
amount 1/A: for each demand instance d, set a(d) = a(d)/\ and for each edge e, set /3(e) = /3(e)/A. 
Notice that the (2, f3) forms a feasible dual solution. 

Let val(a, j3) and val(a?, f3) be the objective value of the dual assignment {a, (3) and the dual 
feasible solution (a, (3), respectively. By the weak duality theorem, val(S,/3) > p(0pt). The scaling 
process implies that val(S, (3) = val(a,/3)/A. We now establish a relationship between val(a, /3) and 
p(S). 

Let R denote the set of all demand instances that are raised in the first phase. The value 
val(a, (3) can be computed as follows. For any d G R, at most A + 1 dual variables are raised by an 
amount 5(d) (because |7r(<i)| < A). So, whenever a demand instance d G R is raised, the objective 
value raises at most (A + I) 5(d). Therefore, 

val(a,/3) = ^a(a) + ^/3(e) 

< (A + 1)£W (1) 

deR 

We next compute the profit of the solution p(S). For a pair of demand instances d\, cfe £ R, we 
say that d\ is a predecessor of d2, if the pair is conflicting and d\ is raised before efo; m this case di 
is said to be a successor of d\. For a demand instance d € R, let pred(d) and succ(d) denote the 
set of predecessors and successors of d, respectively; we include d in both the sets. 

Consider an element d G S. We claim that 

P (d)> y, *W- w 

d'epred(d) 
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To see this claim, consider the iteration in which d is raised. At the beginning of this iteration the 
constraint of d is unsatisfied and d is raised to make the constraint tightly satisfied. The interference 
property ensures the following condition: any demand instance d! € pred(d) with d' 7^ d would have 
contributed a value of at least 6(d') to the LHS of the constraint (because the property enforces 
that the LHS includes at least one of raising dual variables of d'). Thus, at the beginning of the 
iteration, the value of the LHS satisfies: 

p(d) > LHS > E W- 

d! : c('eprcd(d) and d'^d 

When d is raised, LHS increases at least by 5(d) and the constraint becomes tight. This proves the 
claim. 

We can now compute a lowerbound on the profit of S: 

p(s) = j2 P (d) 

des 

E E w 

deS d'epred(d) 

E E m 

d'ER d&ucc(d')nS 

E 8(<t) ■ |succ(cOnSl. 

d'GR 

E ( 3 ) 

d'eR 



> 



> 



The second statement follows from ([2]) and the last statement follows from the fact that for any 
d' € R, either d! belongs to S or a successor of d! belongs to S (this is by the construction of the 
second phase). 

Comparing ([I]) and ([3]), we see that val(a, 0) < (A + l)p(S). The lemma follows from the 
observations made at the beginning of the proof. □ 

A local-ratio based sequential 3-approximation algorithm for the unit height case of tree- 
networks is implicit in the work of Lewin-Eytan |13| . This algorithm can be reformulated in 
the two-phase framework with parameters critical set size A = 2 and slackness A = 1 (however, 
the round complexity can be as high as n). We present the above algorithm in Appendix [A] the 
purpose is to provide a concrete exposition of the two-phase framework. 

Panconesi and Sozio |16j designed a distributed algorithm for the throughput maximization 
problem restricted to line-networks. In terms of the two-phase framework, their algorithm satisfies 
the interference property with critical set size A = 3 and slackness A = 1/(5 + e). To this end, 
they partition the demand instances in to logarithmic number of groups based on their lengths, 
wherein the lengths of any pair of demand instances found within the same group differ at most by 
a factor of 2. Then they exploit the property that if d\ and ^2 are overlapping demand instances 
found within the same group , then c?2 is active either at the left end-point, the right end-point 
or the mid-point of d\. This way, they satisfy the interference property with A = 3. We do not 
know how to extend such a length-based ordering to our setting of tree-networks. Consequently, 
designing an ordering satisfying the interference property with a constant A turns out to be more 
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challenging. Nevertheless, we show an ordering for which A = 6. Furthermore, we shall present a 
method for improving the slackness parameter A to (1 — e). The notion of tree-decompositions and 
layered decompositions form the core components of our algorithms. 

4 Tree-Decompositions and Layered Decompositions 

We first define the notion of tree-decompositions and show how to construct tree decompositions 
with good parameters. Then, we show how to transform tree decompositions into layered decom- 
positions. 

Let H be a rooted tree defined over the vertex-set V with g as the root. For a node x, define 
its depth to be the number of nodes along the path from g to x; the root g itself is defined to have 
to depth 1. With respect to H, a node y is said to be an ancestor of x, if y appears along the path 
from g to x; in this case, x is said to be a descendent of y. By convention, we do not consider x to 
be an ancestor or descendent of itself. For a node z in H, let C(z) be the set consisting of z and 
its descendents in H. 

4.1 Tree-decomposition: Definition 

Let T E T be a tree-network defined over the input vertex-set V consisting of n vertices. A subset 
of nodes C C V is called a component, if C induces a (connected) subtree in T. We say that a 
node x G V — C is a neighbor of C, if x is adjacent to some node in C. Let T[C] denote the set 
of neighbors (or neighborhood) of C. Notice that for any two nodes x £ C and y C, the path 
between x and y must pass through some node in the neighborhood T[C]. 

Let T be a tree- network and H be a rooted-tree defined over V with </ as the root. We say 
that H is a iree decomposition for T, if the following conditions are satisfied: (i) for any demand 
instance d € T>(T), if d passes through nodes x and y then d also passes through LCA(x,y), which 
is the least common ancestor of x and y in i?; (ii) for any node z in C(z) forms a component 
in T. 

For a node z E H, let denote the set of neighbors of the component C(z), i.e., r[C(z)]. 
We call x( z ) the pivot set of z. Clearly, for any nodes x G C(z) and y ^ C{z), the path between 
a; and y in T must pass through one of the nodes in x( z )- We shall measure the efficacy of a tree 
decomposition H using two parameters: (i) pivot size 0: this is the maximum cardinality of x( z ) 
over all z EV; (ii) the depth of the tree. 

See Figure [3] for an illustration. This figure shows an example tree-decomposition for the tree- 
network shown in Figure [6l The demand instance (4, 13) passes through nodes 2 and 8; it also 
passes through LCA(2,8) = 5. For the node 2, the component C(2) = {2,4}; its pivot set is 
X(2) = {1, 5}. On the other hand, C(5) = {5, 9, 8, 2, 12, 13, 4} and its pivot set is x(5) = {1}. This 
tree-decomposition has depth 4 and pivot set size 9 = 2. 

We note that it is not difficult to design tree-decompositions with parameters (depth = n, 8 = 1) 
or (depth = log n, 9 = log n). As it turns out the depth of the tree-decomposition will determine the 
number of rounds, whereas the pivot size 9 will determine the approximation ratio. Thus, neither 
of these two tree-decompositions would yield an algorithm that runs in polylogarithmic number of 
rounds, while achieving a constant factor approximation ratio. Our main contribution is a tree- 
decomposition with parameters (depth = 21ogn, 9 = 2) (we call this the ideal tree- decomposition) . 
Interestingly, the ideal tree-decomposition builds on the two simpler tree-decompositions mentioned 
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Figure 3: Tree-decompositions: Illustration. 

above. For the sake of completeness, the two simpler tree-decompositions are discussed next. 
4.2 Two Simple Tree-decompositions 

Here, we present two tree decompositions called root-fixing tree decomposition and balancing tree 
decomposition. The first decomposition has pivot size 6 = 1, but its depth can be as high as n. 
The second decomposition has depth logn, but its pivot size 6 can be as high as logn. 

Root-fixing Decomposition: Let T £ T be any input tree-network. Convert T into a rooted- 
tree by arbitrarily picking a node g € V as the root; let the resulting rooted-tree be H. It is easy 
to see that H is a tree decomposition for T. Consider any node z and let z 1 be its parent in H; let 
C(z) be the descendants of z including z itself. Notice that for any x € C(z) and y g" C(z), the 
path between x and z must pass through the parent z' . Thus the component C(z) has only one 
neighbor. We see that H has pivot size 6 = 1; however, the depth of H can be as high as n. Figure 
[6] shows a root-fixing decomposition; the chosen root is node 1. The sequential algorithm given in 
Section [A] implicitly uses the root-fixing tree decomposition. 

Balancing tree decomposition: Let T € T be a tree-network. Consider a component C C V 
and let T(C) be the (connected) subtree induced by C. Let z be a node in C. If we delete the 
node z from T(C), the tree T(C) splits into subtrees T\,T<2, . . . , T s (for some s). Let Ci, Cg, . . . , C s 
be the vertex-set of these subtrees. Every node in C — {z} is found in some component Cj. We 
say that the node z splits C into components C\, C2, ■ ■ ■ C s . The node z is said to be a balancer for 
C, if for all 1 < i < s, |C, < |JC|/2j. The following observation is easy to prove: any component 
Ccy contains a balancer z. 

Our procedure for constructing the tree decomposition for T works recursively by calling a pro- 
cedure BuildBalTD (build balanced tree decomposition) . The procedure takes as input a component 
Ccy and outputs a rooted-tree having C as the vertex-set. It works as follows. Given a compo- 
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nent C, find a balancer z for C. Then split C by z and obtain components Ci, C2, . . . ,C S (for some 
s). Each component Cj has size at most [C/2J. For 1 < i < s, call the procedure BuildBalTD 
recursively on the component Ci and obtain a tree Hi with as the root. Construct a tree H by 
making z as the root and gi, . . . , g s as its children. Return the tree H. 

Given a tree-network T, we obtain a rooted-tree H by calling BuildBalTD with the whole 
vertex-set V as the input. It is easy to see that for any node z, C(z) forms a component in T. For 
any node z in H with children z\, Z2, ■ ■ ■ , z s (for some s), C(zi), C(z2 ),•••, C(z s ) are nothing but 
the components obtained by splitting C(z) by z. This implies that H satisfies the first property 
of tree decompositions. Since the size of the input component drops by a factor of two in each 
iteration, the depth of H is at most [logn]. Consider any node z in H and let C(z) be the set 
consisting of descendants of z and z itself. Observe that for any node x G C(z) and y C(z), 
the path between x and y must pass through one of the ancestors of z in H (because of the first 
property of tree decompositions). In other words, the neighborhood of C{z) is contained within 
the set of ancestors of z. The number of ancestors is at most [logn] and hence, the pivot size of 
H is at most O(logn). Figure [3] shows an example balancing tree-decomposition for the tree given 
in Figure [U 

4.3 Ideal Tree-decomposition 

In this section, we present the ideal tree-decomposition with parameters (depth = 2 logn, 9 = 2). 
The ideal tree-decomposition also goes via the notion of balancers. Recall that any component 
Ccy contains a balancer z. 

Fix a tree-network T and we shall construct an ideal tree decomposition H for T with pivot set 
size 9 = 2 and depth O(logn). Intuitively, the tree H will be constructed recursively. In each level 
of the recursion, we will add two nodes to the tree: a balancer and a node that we call a junction. 
The output tree-decomposition will have depth at most 2 [log n] . 

The construction works via a recursive procedure BuildldealTD (build ideal tree decomposition). 
The procedure BuildldealTD takes as input a set C C V forming a component in T. As a 
precondition, it requires the component C to satisfy the important property that C has at most 
two neighbors in T. It outputs a rooted-tree H with C as the vertex set having depth at most 
2[log |C|] such that for any node x € C, the number of neighbors of C(x) is at most 2, where C(x) 
is the set consisting of x and its descendants in H. 

The procedure BuildldealTD works as follows. We first find a balancer z for the component C. 
The node z splits C into components Ci, C2, . . . , C s . We shall consider two cases based on whether 
C has a single neighbor or two neighbors. 

Case 1: This is the easier case where C has only one neighbor, say u±. See Figure HI For this 
case, ignore the nodes U2 and u' 2 . Let u' x be the node in C which is adjacent to u\ and without loss 
of generality, assume that C\ is the component to which belongs. Observe that T(C\) = {ui, z} 
and for all i > 2, r(Cj) = {z}. In other words, all the components Ci have at most two neighbors. 
That is, they all satisfy the precondition set by the procedure. For each 1 < i < s, we recursively 
call the procedure BuildldealTD on the component Ci and obtain a tree Hi with gi as the root. We 
construct a tree H by making z as the root and gi,g2, ■ ■ ■ ,gs as its children. Then, the rooted-tree 
H is returned. 

Case 2: Now consider the case where C has two neighbors, say u\ and U2- Let u\ and u' 2 be 
the nodes in C which are neighbors of u\ and U2, respectively. We consider two subcases. 
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Figure 4: Illustration for ideal tree-decomposition. 

Case 2(a): The first subcase is when u' x and u' 2 lie in two different components, say C\ and 
C2, respectively. See Figured! Observe that r(Ci) = {ui,z}, r(C*2) = {U2, z} and for all i > 3, 
r(Cj) = {z}. Hence all the components C% satisfy the precondition set by the procedure. For 
each 1 < i < s, we call the procedure BuildldealTD with Cj as input and obtain a tree Hi. We 
construct a tree H by making the balancer z as the root and gi,g2, ■ ■ ■ ,9s as its children. Then, 
the rooted-tree H is returned. 

Case 2(b): Now consider the second and comparatively more involved subcase wherein u\ and 
u' 2 belong to the same component, say C\. See Figure [SJ Observe that there exists a unique node 
j G C\ such that all the three paths u\ ~^ U2, u\ z, and 112 z pass through j. We call j 
as the junction. Spilt the component Ci by the node j to obtain components C[,C 2 , . . . , C' s , (for 
some s'). Observe that among C[,C 2 ,. . . ,C S ,, there exists three distinct components such that z 
is a neighbor of the first component, and u'-y and u' 2 belong to the other two components; without 
loss of generality, let these components be C[, C 2 and C 3 , respectively. We see that for 2 < i < s, 
r(Ci) = {z}; moreover, r(C() = {j,z}, T(C 2 ) = { Ul ,j}, T(C 3 ) = {u 2 ,j} and for 4 < i < s', 
r(Cj') = {j}. Thus, all the components C2, C3, ... ,C S and C[, C 2 , . . . , C' s , satisfy the precondition 
set by the procedure. For each 2 < i < s, we call the procedure BuildldealTD recursively with 
Ci as input and obtain a tree Hi with gi as the root. For each 1 < i < s' , we call the procedure 
BuildldealTD recursively with C[ as input and obtain a tree H^ with g[ as the root. Construct a 
tree H as follows. Make the junction j as the root; make g 2 ,g' 3 , . . . ,g' s , as the children of j; make 
z as a child of j; make g[ and g2,g3, ■ ■ ■ ,gs as the children of z. Return the rooted-tree H. This 
completes the description of the procedure BuildldealTD. 

By induction, we can argue that BuildldealTD satisfies the intended property: for any node 
x € C, the number of neighbors of C(x) is at most 2. As an example, consider the subcase in which 
u\ and u' 2 belong to the same component (the case where a junction j is created). The procedure 
creates only two nodes j and z on its own and the rest of the nodes in H are created by the recursive 
calls. Consider the node j. It is guaranteed that the input component C has at most two neighbors 
(this is the precondition set by the procedure). Since C(j) = C, we see that j satisfies the property. 
Now, consider the node z. The component C(z) is the union of 6*2,6*3, ...,C S and C[. We have 
that r[C(z)] = {j}. Thus, z also satisfies the property. The rest of the nodes satisfy the property 
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Figure 5: Illustration for Case 2(b) of ideal tree-decomposition 

by induction. 

Let us now analyze the depth of the tree H output by the procedure. Since z is a balancer 
for C, the components C\, C2, . . . ,C S have size at most [C/2\. Moreover, since C[,C 2 , ... ,C' S , are 
subsets of Ci, these components also have size at most [C/2J . Thus, all the components input to 
the recursive calls have size at most [C/2\. Thus, by induction, H has depth at most 2|~logC~|. 

We next show how to construct a tree decomposition H for the tree- network T. First, find 
a balancer g for the entire vertex-set V and split V into components C±, C2, ■ ■ ■ , C s . For each 
component Cj, r[Cj] = {g}. For each 1 < i < s, call the procedure BuildldealTD with Cj as input 
and obtain a tree Hi with gi as the root. Construct a tree H by making g as the root and each g^ 
as its children. Return H. 

We can argue that for any node z in H, C(z) forms a component in T. Furthermore, for any 
node z in H with children z\, Z2, ■ ■ ■ , z s (for some s), C(zi), C(z2), ■ ■ ■ ,C(z s ) are nothing but the 
components obtained by splitting C(z) by z. This implies that H satisfies the first property of 
tree decompositions. It follows that H is indeed a tree decomposition. The depth of H is at most 
2[~logn]. The properties of the BuildldealTD procedure ensure that the pivot size of H is at most 
2. We have the following result 

Lemma 4.1 For any tree-network T £ T , there exists a tree decomposition H (called the ideal tree 
decomposition) with depth O(logn) and pivot size 6 = 2. 
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4.4 Layered Decompositions 

In this section, we define the notion of layered decompositions and show how to transform tree 
decompositions into layered decompositions. 

Let T G T be a tree-network. A layered decomposition of T is a pair a and n, where a is a 
partitioning of T>(T) into a sequence of groups Gi, G2, ■ ■ ■ , G% and tt maps each demand instance 
d G T>(T) to a subset of edges in path(cf). The following property should be satisfied: for any 
1 < i < j < £ arid for any pair of demand instances di G C; and d<i G Gj, if cii and d<i are 
overlapping, then path^) should include at least one of the edges in n(di). The edges in ir(d) are 
called the critical edges of d. The value I is called the length (or depth) of the decomposition. 

Notice that similarity between the inference property and the notion of layered decompositions. 
We shall measure the efficacy of a layered decomposition by two parameters: (i) Critical set size 
A - this is the maximum cardinality of n(d) over all demand instances d G T>(T); (ii) the length t 
of the sequence. Our goal is to construct a layered decomposition with length O(logn) and critical 
set size A = 6. Towards that goal we shall show how to transform tree-decompositions into layered 
decompositions. The following notations are useful for this purpose. 

Let T G T be tree-network and H be a tree-decomposition for T with pivot size 9 and depth I. 
For a demand instance d, let 11(d) be the node with the least depth in H among all the nodes that 
path(ci) passes through. The first property of tree decompositions ensure that fi(d) is unique. We 
say that d is captured at fj,(d). See Figure El In this figure, the demand (4, 13) is captured at node 
5. Let d G V(T) be a demand instance and u be a node in T. Observe that there exists a unique 
node y belonging to path(d) such that the path from u to y does not pass through any other node 
in path(d). We call y as the bending point of d with respect to u. For a node y in path((i), we call 
the edges on path(d) adjacent to y as the wings of y on path(d). If y is an end-point of d, there 
will be only one wing; otherwise, there will be two wings. See Figure [6l In this figure, with respect 
to nodes 3 and 9, the bending points of the demand d = (4, 13) are 2 and 5, respectively. With 
respect to path(d), node 4 has only one wing (4, 2), while node 8 has two wings (5, 8) and (8, 13). 

Lemma 14.21 shows how to transform a tree-decomposition into a layered decomposition. The 
lemma is proved by categorizing the demand instances into groups G%, G2, ■ ■ ■ , Gg, where Gi consists 
of all demand instances captured at a node with depth £ — i + 1. For a demand instance d G T>(T), 
let z = fi(d). The set ir(d) is constructed as follows: (i) we include the wings of z on path(d); (ii) 
for each neighbor u of C(z), taking y to be the bending point of path(cf) with respect to u, we 
include the wings of y on path(d). 

Lemma 4.2 Let T G T be a tree-network and H be a tree decomposition for T with pivot size 8 
and depth t. Then H can be transformed into a layered decomposition (a, n) with critical set size 
A = 2(9 + 1) and length i. 

Proof: For 1 < i < £, let Gi to be the set consisting of all demand instances d such that depth of 
n(d) is i. We define a to be the reverse of G±, G2, ■ ■ ■ , Gf, namely, let a = G±, G2, . . . , Ge, where 
Gi = G^_j + i, for 1 < i < t. Thus, in a, the demand instances captured at the nodes having the 
highest depth are placed in G\ and the demand instances captured at the root are placed in Gi. We 
now show how to construct the critical set n(d) for each demand instance d G T>(T). Let z = fi(d) 
be the node in H where d is captured. Add the wing(s) of z on path(<i) to ir(d). Then, consider 
the component C(z) consisting of z and its descendents in H. Let U = {u\,U2, ■ ■ ■ ,u s } be the 
neighbors of C(z), where s < 9. For 1 < i < s, let yi be the bending point of d with respect to 
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Figure 6: An example tree-network. 

Uf, add the wing(s) of y, L on path(d) to Tt(d). Notice that Tr(d) has at most 2(9 + 1) edges. This 
completes the construction of a and 7r(-). 

We now argue that the construction satisfies the properties of layered decompositions. Consider 
any two groups G{ and Gj such that i < j. Consider two overlapping demand instances d\ € 
and d2 € Gj. Let z\ = n(d\) and Z2 = ^(d^) be the nodes in H where d\ and are captured, 
respectively. We consider two cases: (1) Z2 £ C(z\); (2) Z2 C(z{). 

Case 1: In this case, Z2 must be the same as z\ (otherwise, we have depth^) > depth(^i); 
this would contradict i < j). Therefore, path^) should include at least one of the wings of Z\ on 
path(di). Recall that the wing(s) of Z\ on path(di) are included in 7r(di). 

Case 2: By the LCA property of tree-decompositions, path(di) will be constained within 
the component C(z\). We have that path(c?2) goes through the node Z2 found outside of C(zi); 
moreover, it also goes through some node found within C(zi) (since d\ and overlap). By the 
second property of tree decompositions, such a path must also pass through one of the neighbors 
of C(zi); let u be such a neighbor. Let the bending point of path(di) with respect to u be y. Since 
path(c?2) passes through u and overlaps with path(di), the path^) must also pass through the 
bending point y. It follows that path^) must include one of the wings of y on path(di). Recall 
that the wing(s) of y on path(di) are included in Tv(d\). □ 

By applying Lemma 14.21 for the ideal tree decomposition (given by Lemma l4.ip . we establish 
the following result. 

Lemma 4.3 For any tree-network T € T, we can construct a layered decomposition with critical 
set size A = 6 and length at most O(logn). 
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Begin 

/ / Initialize 

For all a G A, set a(a) = 0; for all e G £, set /3(e) = 0. 
Initialize an empty stack. 

Let the input set of tree-networks be T = {21, T2, . . . , T r }. 
For each tree-network T q 

Invoke Lemma 14.31 on T q and 

obtain a layered decomposition a q = G^\ G% , . . . , and a mapping 7T g . 



Let f m aj 

For each d G V(T q ), define 7r(<i) = ir q (d). 
For each fe = 1 to 4nax, 
Define G fe = Uj =1 G^. 

/ / First phase 

For k = 1 to £ max / /Epochs 

Let b be the smallest integer such that (14/15) b < e 
For j = 1 to b I /Stages 

While / / Steps or iterations. 

Let U = {d g G k : d is (1 - (14/15)- ? ')-unsatisfied} 
If U = 0, exit the loop. 

Find a maximal independent set I contained within U 
For each d G I 

Compute slackness: s = p(cQ — a(arf) — X] e -(2~e /^( e )- 

Compute: 8(d) = s/(\ir(d)\ + 1). 

Raise the variables: a(a,d) <— a(ad) + S(d); for all e G 7t(gT), /3(e) /3(e) + 5(d). 
Push I into the stack (as a single object). 

/ / Second Phase 
5 = 0. 

While(stack not empty) 

Pop the top element I of the stack 
For each d G I 

If S U {d} is an independent set, then add d to S. 
Output S. 
End 



Figure 7: Pseudocode of the overall algorithm 

5 Distributed Algorithm 

In this section, we prove the main result of the paper by exhibiting a two-phase procedure with 
critical set size A = 6 and slackness parameter A = (1 — e), for any constant e > 0. 

Let the input tree networks be T = {T\,T2, ... ,T r }. For each tree- network T q , invoke Lemma 
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14,31 and obtain a layered decomposition a q = , G% , • • • , GfJ of length t q and a mapping 7r g . 
Let 

^max — maXq£q. The lemma guarantees that £ max is O(logn) and all the critical set sizes are 
at most A = 6. Let A' = A + 1 and £ = (2A')/(2A' + 1) = 14/15. For the ease of exposition, we 
combine all the mapping functions into single mapping function tt, as follows. For each tree-network 
T q and demand instance d G V(T q ), define vr(cZ) = ir q (d). 

For each 1 < k < i ma x, let Gk be union of the fcth components of all the layered decompositions: 
Gk = ^ r q= \G k q \ The algorithm would follow the two-phase framework. All the dual variables are 
initialized to zero and an empty stack is created. The first phase is split into ^ max epochs. Epoch 
A; will process the group Gk- Our goal is to ensure that at the end of the epoch, all the demand 
instances in Gk are (1 — e)-satisfied. Each epoch is divided into multiple stages, with each stage 
making a gradual progress towards the goal. We will ensure that at the end of stage j, all the demand 
instances in Gk are (1 — ^-satisfied. Each stage is split into multiple steps (each step corresponds 
to an iteration of the two-phase framework). A typical step is explained next. Let U be the set of all 
demand instances in Gk that are (1 — £ J )-unsatisfied. Find a maximal independent set I contained 
within U. For all demand instances d € /, raise the demand instance d as prescribed by the 
framework, taking ir(d) to be the critical edges. Namely, for all demand instances d £ I, perform the 
raising as follows. Compute the slackness s = p(d) — a(ad) — X] e ■ d~e/^( e ) an d ${d) = s/(\ir(d)\ + l). 
Raise the dual variable a(a^) by the amount 5(d) and for all e € Tr(d), raise the dual variable /3(e) 
by 5(d). The stage is completed when all the demand instances in Gk are (1 — £ J )-satisfied and 
we proceed to the next stage. The epoch is completed when all the demand instances in Gk are 
(1 — e)-satisfied. The second phase is the same as that of the two-phase framework. The pseudocode 
is provided in Figure [7J 

Let us analyze the number of steps (or iterations) taken by the above algorithm. The number of 
epochs is ^ max , which is O(logn). Each epoch has at most log^ e = 0(log(l/e)) stages. The lemma 
below provides a bound on the number of steps taken by each stage. It follows that the total 
number of communication rounds is at most 0(Time(MIS) lognlog(l/e) log(p max /p m i n )), where 
Time(MIS) is the number of (communication) rounds needed to find a maximal independent set 
(see Introduction). 

Lemma 5.1 Consider any epoch k and stage j within the epoch. The number of steps taken by the 
stage is at most 0(log(p max /p min )). 

Proof: Let the number of steps taken by the stage be L. For 1 < i < L, let Ui be the demand 
instances in Gk that are (1 — ^^-unsatisfied at the beginning of step i. Let Jj, J2, ■ ■ ■ ,Il be the 
sequence of maximal independent sets computed in these steps. For two demand instances di,d2 £ 
Gk, we say that d\ kills d2 in step i, if d\ G Ij, d% 6 J7i+i, and d\ and c?2 are conflicting. Intuitively, 
both d\ and cfe are present in Ui, and both are contenders for the maximal independent Ii. Of the 
two, d\ got selected in Ii and d^ was omitted; even after the demand instances in Ii were raised, 
c^2 was still (1 — £ J )-unsatisfied. Since d\ and d2 are conflicting, only one of them can be included 
in the independent set. We imagine that d\ "kills" ^2- 

Claim 5.2 Suppose d\ kills di in step i. Then, their profits satisfy p(d2) > 2p(d\) 

We now prove the claim. Since d\ £ Ii, the demand instance is (1 — ^-unsatisfied at the 
beginning of step i. Hence, the difference between the LHS and RHS of the constraint is at least 
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£ J • p(di). The number dual variables raised for d\ is at most A + 1. Hence, 



5( dl )>^- p{dl 



(A + l) 

Since d\ and di are conflicting, either it is the case that d\ and d2 belong to the same demand a 
or they belong to the same tree-network T q (for some q) and overlap. In the former case, the dual 
constraints of d\ and d 2 share the dual variable a(a^). In the latter case, both d\ and d 2 belong to 
the same group G k q \ Hence, the properties of layered decompositions imply that one of the critical 
edges in 7r(di) also appears in the path(o?2). Thus, in either case, when d\ is raised, the LHS of 
d 2 is also raised by an amount b[d\). On the other hand, d 2 € U^i and so, even after the above 
raise in the LHS value, d 2 is still (1 — £ J )-unsatisned. As we are considering stage j, all the demand 
instances in Gk are (1 — (£) J_1 )-satisfied. The gap between (1 — ^^ l )p{d 2 ) and (1 — &)p(d 2 ) is 
(£ J_1 — &)p(d 2 ). We see that even after the value of the LHS of the dual constraint of d 2 is raised 
by an amount 5(d%), the above gap is not bridged. It follows that 

^- 1 -e) P (d 2 )>6(d 1 )> t ''' {<], ] 



(A + l) 
This implies that 

p(d 2 ) > e 



p(dx) - (1-£)(A + 1) 

We derive the claim by substituting £ = 14/15 and A = 6. 

Consider any demand instance di € Ul- There must exist a demand instance dL-i in Ul-i 
such that dz-i kills d^. In general, we can find a sequence of demand instances dL,dL-i, ■ ■ ■ ,d\ 
such that for 1 < i < L — 1, d{ kills di + \. By the above claim, for 1 < i < L — 1, > 2p(d{). 

It follows that p(d L ) > 2 L - 1 p(d 1 ). Hence, L < 1 + \og(jp{d L ) / p{ dl )) = 0(log(p max /p min )). □ 

The properties of layered decomposition imply that the above two-phase algorithm satisfies the 
interference property, governed by parameters A = 6 and A = (1 — e). Therefore, by Lemma l3.lt 
it follows that the algorithm has an approximation ratio of 7/(1 — e). For e' > 0, we can choose 
e suitably and obtain an approximation ratio of (7 + e'). We have proved the main result of the 
paper. 

Theorem 5.3 There exists a distributed algorithm for the unit height case of the throughput maxi- 
mization problem on tree-networks with approximation ratio (7+e) and number of (communication) 
rounds is at most 0(Time(MIS) lognlog(l/e) log(p max /p m i n )) ; where e > is any constant. 

Remark: Recall that Panconesi and Sozio [15] presented an algorithm for the unit height case 
of line- networks. Their algorithm follows the two-phase framework with the slackness parameter 
A = l/(5 + e). On the other hand, our algorithm has A = (1 — e). A comparison of the two algorithms 
is in order. We reformulate their algorithm to suit our framework. Their algorithm also classifies 
the demand instances into groups (based on length) and processes the groups in epochs. However, 
each epoch consists of only a single stage. They split the stage into multiple iterations/steps. In any 
iteration, a demand instance d which is (1/(5 + e))-satisfied is ignored for the rest of the first phase. 
In contrast, our algorithm works in multiple stages, where in each stage, we make gradual progress 
towards making the demand instances within the group to be (1 — e)-satisfied. In particular, in 
stage j, a demand instance which is (1 — f^-satisfied is not ignored; it exits the current stage, but 
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it is included in the MIS computations in the next stage. 

Distributed Implementation: Here, we sketch certain aspects of implementing the algorithm 
in a distributed manner. Let M max be the number of bits needed to encode the information about 
any demand (such as its end-points and profit). 

For now, assume that the values p max and p m i n are known to all the processors. Under this as- 
sumption, we can count the number of epochs, stages and iterations exactly. The number of epochs 
is 2[logn] (the maximum depth of ideal tree decompositions); the number of stages within each 
epoch is [logg e], where £ = 14/15; the number of iterations within each stage is clog(p max /p m in)) 
where c is a suitable constant. 

Each processor P computes the ideal tree-decomposition and the corresponding layered decom- 
position for each tree-network T accessible to it. Each processor maintains the values of the dual 
variables correspoding to its demand instances. The algorithm proceeds in a synchronous fashion 
consisting of multiple communication rounds, where each round corresponds to a tuple (/i, fa, fa), 
where fa, fa and fa are the epoch, stage and iteration number of the pseudocode. Given a tuple 
{fa, fa, fa) , a processor P can determine which demand instances can participate in the MIS cal- 
culation of this communication round. The MIS calculation is performed considering the conflict 
graph: the demand instances participating in the MIS computation form the vertices and an edge 
is drawn between a pair of vertices, if they are conflicting. The number of vertices is at most 
N = mr, where m is the number of demands and r is the number of tree- networks. The MIS can 
computed using either the randomized algorithm of Luby [14] or using the deterministic procedure 
of network decompositions [T7]. In the former case, the number of (communication) rounds needed 
in O(logiV), whereas in the latter case, it is 0(2 v/logAr ). Each processor P contains at most one 
demand instance d belonging to the MIS. The processor P raises the dual variables corresponding 
to d as given in the pseudocode. The new dual variables are transmitted to the processors sharing 
a common resource with P. Upon receiving the new dual variables, each processor updates the 
dual variables of its demand instances suitably. Each processor P raises at most a constant num- 
ber of dual variables in each iteration (because the critical set size A = 6 is a constant) and the 
amount of increase is at most p max . Thefore, the message size is bounded by M max . The stack is 
implemented in a distributed manner. Each processor P maintains its own stack. Whenever the 
processor P raises a demand instance d, it pushes d onto its stack along with a tuple t (consisting 
of the corresponding epoch, stage and iteration numbers). 

The second phase proceeds in the reverse order. Each round of the second phase is associated 
with a tuple t (consisting of epoch, stage and iteration numbers). A processor P will compare tuple 
t' on the top of the stack with the tuple t; if they match, then it will pop the demand instance d 
on the top of the stack. It will output d, if feasibility is maintained. In this case, the processor P 
will inform its neighboring processors that d has been included in the output solution. 

Finally, we note that it is not difficult to bypass the assumption that all the processors know 
the values of p max and p m in- 

6 Arbitrary Height Case for Tree-Networks 

Panconesi and Sozio [15] presented a distributed (20 + e)-approximation algorithm for the unit 
height case of the line- networks. Then, they extended this algorithm for the arbitrary height case 
and obtained a (55 + e)-approximation algorithm [16]. In this section, we extend our (7 + e)- 
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approximation algorithm for the unit height case of the tree-networks to the arbitrary height case 
and derive a (20+e)-approximation algorithm. We note that the extra ideas needed for the extension 
roughly follow the theme of Panconesi and Sozio |16j . The main difference is that their algorithm 
follows the two-phase framework with the slackness parameter being A = 1/(5 + e), whereas we aim 
for A = (1 — e). Below we highlight the necessary changes. 

We first develop some notation for the arbitrary height case. The problem setup is as before, 
except that each demand a € A also has a bandwidth requirement (or height) h(a). All the edges 
in all the tree-networks are assumed to provide a uniform bandwidth of 1 unit. For each demand 
a € A owned by a processor P and each tree-network T accessible to P, create a copy of a with the 
same end-points, height and profit; these copies are called demand instances of a. Let T> denote 
the set of all demand instances (over all demands). A demand instance d £ T> belonging to a 
tree-network T has a height h{d) and a profit p(d), and it can be viewed as a path between its 
end-points in T. A feasible solution selects a subset of demand instances S C T> such that: (i) 
for any demand a € A, at most one demand instance of a is selected; (ii) for any edge e in some 
tree-network T, among the demand instances in S, the sum of heights of the demand instances 
passing through e is at most 1 unit. The profit of the solution is the sum of profits of the demand 
instances contained in it. The goal is to choose the solution having the maximum profit. 

We classify the input demand instances d € V into two categories based on their height: (i) 
narrow instances: d is said to be narrow, if h(d) < 1/2; (ii) wide instances: d is said to be wide, 
if h(d) > 1/2. Notice that two wide demand instances which are overlapping cannot be picked 
by a feasible solution. Hence, if our input consists only of wide demand instances, we can reuse 
the algorithm for the unit height case and get a (7 + e) approximation factor. We next describe 
a (13 + e)-approximation algorithm for the special case where in the input consists of only narrow 
instances. The final algorithm will be derived by combining the above two algorithms. 

6.1 Narrow Instances 

Here, we assume that the input consists of only narrow instances and develop a (13+e)-approximation 
algorithm. 

LP and Dual 

The LP and the dual have to be modified suitably to reflect the notion of heights. Recall that for 
each demand instance d € T>, we have a variable x(d). 

5>(d)-p(d) 
1 for edges e € £ 
1 for all demands a £ A 

for all demand instances d 6P 

The first set of constraints capture the fact that the cumulative height of the demand instances 
active on an edge e cannot exceed one unit. Similarly, the second set of constraints capture the 
fact that a feasible solution can select at most one demand instance belonging to any demand. 



max 
x{d) ■ h(d) < 

d£V : d~e 

£ <d) < 

d£lnst(a) 

x(d) > 
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The dual of the LP is as follows. For each demand a £ A and each edge e E £, the dual includes 
a variable a(a) and /3(e), respectively. Similarly, for each demand instance d S T>, the dual includes 
a constraint; we call this the dual constraint of d. Recall that ad denotes the demand to which a 
demand instance d belongs. 

5>(a) + £ /3(e) 

aeA ee£ 

> p(d) for demand instances d € V 

The dual also consists of the non- negativity constraints: for all a G A and e € £, a(a) > and 
/3(e) > 0. 



min 

a{a d ) + h(d) Y, ^ 

e : dr^e 



Two-phase Framework 

We modify the two-phase framework as follows. As before, the algorithm proceeds iteratively and 
a typical iteration is performed as below. We choose suitable independent set I and raise each 
demand instance d G /. The slackness computation is modified to reflect the notion of heights in 
the constraints. Define the slackness to be: 

8 = p(d) - la(a d ) + h(d) ■ Y P( e )) ■ 

\ e : d~e / 

We next select a suitable subset n(d) consisting of critical edges on which d is active. The strategy 
for raising the dual variables is modified slightly. We raise a(a^) by 6(d) and for each e € ir(d), 
raise (3(e) by 2\ir(d)\6(d). Towards that goal, define 6(d) = s/(l + 2h(d)\-rr(d)\ 2 ). We see that the 
dual constraint is satisfied tightly in the process. The second phase of the algorithm remains the 
same. 

The parameters critical set size A and slackness A are defined as before. Similarly, the concept 
of interference property remains the same. Lemma 13.11 can be extended as follows, using similar 
arguments. 



Approximation Guarantee 

Lemma 6.1 Suppose the input consists of only narrow instances. Consider any algorithm satis- 
fying the interference property and governed by parameters A and A. Then the feasible solution S 
produced by the algorithm satisfies p(S) > ^ 1+ 2A a ) '-P(^P t )- 

Proof: As before, let a(-) and /?(•) be the dual variable assignment produced at the end of the 
first phase. Convert the assignment (a, /3) into a dual feasible solution by scaling the values by an 
amount 1/A: for each demand instance d, set a(d) = a(d)/\ and for each edge e, set /3(e) = /3(e)/A. 
Notice that the (a, /3) forms a feasible dual solution. 

Let val(a, /3) and val(a?, f3) be the objective value of the dual assignment (a, /3) and the dual 
feasible solution (S,/3), respectively. By the weak duality theorem, val(S,/3) > p(0pt). The scaling 
process implies that val(S, /3) = val(a, /3)/A. We now establish a relationship between val(a, /3) and 
p(S). 
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Let R denote the set of all demand instances that are raised in the first phase. The value 
val(a, f3) can be computed as follows. For any d G R, the variable a(a,d) is raised by an amount 
6(d) and for each e G 7r(d), the variable /3(e) is raised by an amount 2\7r(d)\5(d). We have that 
\7r(d)\ < A. So, whenever a demand instance d G R is raised, the objective value raises at most 
(2A 2 + l)<5(d). Therefore, 

val(a,/5) < (2A 2 + l)^,5(d). (4) 

We next compute the profit of the solution p(S). For a pair of demand instances d\, di G R, we 
say that d\ is a predecessor of c?2, if the pair is conflicting and d\ is raised before d2\ in this case c?2 
is called the successor of di. For a demand instance <i G i?, let pred(<i) and succ(d) denote the set 
of predecessors and successors of d, respectively; we exclude d from both the sets. 

Define K = R — S; we say that demand instances in K are killed by the procedure. We say that 
a demand d' is killed by a demand d if the following three conditions are true: (i) d G S; (ii) both 
d and d' belong to the same demand; (iii) d' G pred(cf). For a demand d G S, let K±(d) denote the 
set of demand instances killed by d; notice that K\(d) C pred((i). Let K\ be the union of K\(d) 
over all demand instances d G S. Let K2 be the demand instances in K that could not be added 
to the solution because of bandwidth constraints (i.e., K~2 consists of demand instances d such that 
for some edge e G path(d), h(d) + ^2 d / eS : d'~e tyd 1 ) > 1). 

Consider any demand instance d G S and we shall computer a lowerbound on Consider 
the iteration in which d was raised; after the raise, the LHS of the constraint of d becomes equal to 
the RHS (i.e., p(d)). The demand instance d contributes at least 6(d) to the LHS. In the previous 
iterations, any demand instance d! G K±(d) would have contributed 5(d') to the LHS. Similarly, for 
any e G path(d) and any demand instance d' G pred(d) PI K2 such that e G Tt(d'), d' would have 
contributed 2h(d)\ir(d')\5(d') to the LHS. Thus, we see that: 

P(d) > 6(d) + [ Yl S ( d ')) + l E E 2h(d)W)\5(d') 

\d'eK 1 (d) J \eGpath(d) d' £prcd(d)nK 2 : eevr(d') 

Summing up p(d) over all the demand instances d G S, we get that 

P(5)>^5(d)+ J] ^)] + (E E E 2h(d)W)\5(d!) 

deS \d<=S d'eK^d) J \deS eepath(d) d'Gprcd^n^ : e£n(d>) 

Rewriting the second and the third terms of the RHS: 

5>(<o > E s &) + f E 5 ( d 'n + f E W)\6(d>) Y E ^)) • ( 5 ) 

deS d'eS \d'&K l J \d>£K 2 e<=n(d') deSnsucc(d') : d~e J 

Now let us analyze the third term in the RHS. Consider any d' G K~2- The demand instance d! could 
not be added to S because of the bandwidth constraint being violated at some edge e' G path(<f). 
Meaning, 

h(d') + Y h ( d ) > L 

deSnsucc(d') : d~e' 
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Since all demand instances are assumed to be narrow, we have that h(d!) < 1/2. It follows that 

E h w > V2. 

de5nsucc(d') : d~e' 

Let X = {d € S n succ(d') : d ~ e'}. By the interference property, all the demand instances in X 
are active at one of the edges in ir(d'). It follows that there exists an edge e S Tt{d') such that 

1 



> 



2|7r(d')l' 
dex 

where X = {d € X : d~e}. Consider the third term in the RHS of the formula ([5]). The 
second summation is over all the edges e £ ir(d'). We replace this summation by the single quantity 
corresponding to e. We get that 

5>(e0 > E *(<0 + f E W J + f E 2Wd')\5(d') e h ^ 



Therefore, 



* f E ^')) + ( E ) + ( E wi^) >< ^ 



E 



> 

d'ER 

The second statement follows from the fact that R is the union of S, K\ and K2. Comparing (jj]), 
we see that val(a,/3) < (2A 2 + l)p(S'). The lemma is proved by the observations made at the 
beginning of the proof. □ 

Distributed Algorithm 

Fix any e > 0. Our goal is to design a two-phase procedure with critical set size A = 6 and 
slackness parameter A = (1 — e). The distributed algorithm is similar to that of the unit height 
case (Section [5]). We use the same layered decompositions given by Lemma 14.31 (with A = 6 and 
length I = logn). The only difference is that we set the parameter £ = — - % — , for some suitable 
constant c. Arguments similar that of Lemma 15.11 can be used to show that in any epoch k and 
any stage j, the algorithm takes at most 0(log(p max /p m ; n )) steps (or iterations). The number of 
epochs in O(logra), as before. The number of stages within each epoch is log^(e). The above value 
of £ ensures that the above quantity is at most 0{l/h m \ n ) log(l/e). Therefore, the total number of 
steps is 0(Time(MIS) • (l//i m in) ■ (logn) • log(l/e) • log(p max /p m i n )). The above algorithm satisfies 
the interference property with critical set size A = 6 and slackness parameter A = (1 — e). By 
Lemma [6-H the approximation ratio is (2A 2 + 1)/A = 73/(1 — e). We have established the following 
lemma. 

Lemma 6.2 Fix any /i m i n < 1/2 and e > 0. Consider the special case of the scheduling problem 
on tree-networks with heights wherein all the demands a are narrow and satisfy h(a) > h m { n . There 
exists a distributed algorithm for the above problem with approximation ratio (73 + e). The number 
of (communication) rounds is at most O ( Tir " c ( MIS ) Wn log 7 log ^saa 
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Overall Algorithm 

Fix any e' > 0. We present an algorithm within an approximation ratio of (80 + e'), for the arbitrary 
height case of tree-networks. 

We classify the demand instances into wide and narrow instances. Let 0pt x and 0pt 2 denote 
the optimal solutions considering only the wide and narrow instances, respectively. Notice that 
p(0pt) < p(0pt 1 ) +p(0pt 2 ). For the wide instances, we run the algorithm for the unit height case 
(Theorem l5.3p and obtain a solution Si such that p(0pt 1 ) < (7+e')p(Si). Next we run the algorithm 
for the narrow instances (Lemma I6.2[) and obtain a solution S2 such that p(0pt 2 ) < (73 + e / )p(S*2). 
Output a combined solution S as follows. For each tree- network T G T, consider the set of demand 
instances scheduled on T by the solution Si and the set of demand instances scheduled on T by 
the solution S2] among the two sets, choose the one with the higher profit. It is easy to see 
that the bandwidth constraints are satisfied by S. Furthermore, for any demand a, either all the 
demand instances of a are narrow or all of them are wide. Therefore, S will pick at most one 
demand instance from any demand. This shows that S is indeed a feasible solution. We have 
that p(S) > max{p(Si),p(S 2 )}. It follows that p(0pt) < (80 + 2e')p(S). We have established the 
following theorem. 

Theorem 6.3 Fix any /i m i n < 1/2 and e > 0. Consider the scheduling problem on tree-networks 
with arbitrary heights wherein all the demands a satisfy h(a) > h m \ n . There exists a distributed 
algorithm for the above problem with approximation ratio (80 + e). The number of (communication) 



7 Line-Networks 

Recall that Panconesi and Sozio [16j presented distributed algorithms for the case of line-networks 
with windows. For the unit height case the approximation ratio was (20 + e) and for the arbitrary 
height case, the ratio was (55 + e). In this section, we obtain a improved approximation ratios 
(4 + e) and (23 + e), respectively. We next explain the new algorithms within our framework. 

We first develop some notation. Let V be the set of m processors, T be the set of r resources 
and A be the set of n demands. We divide timeline into n discrete timeslots, 1,2, ... ,n. Each 
processor P owns a demand a £ A. Each demand a 6 A is specified by a window [rt(a),dl(a) and 
a processing time p(a), where rt(a) and dl(a) are the release time and deadline of a. A profit p{a) 
and a height h(a) are associated with each demand a. Consider a processor P and the demand 
a owned by P. For each resource T accessible by P and each interval of length p(a) contained 
within [rt(a), dl(a)], create a demand instance d; its profit and height are the same as that of a; the 
number of demand instances is at most n|Acc(P)|, where Acc(P) is the set of resource accessible 
to P. Let T> denote the set of all demand instances. Each demand instance d € T> is described by 
a starting time s(d), and ending time e(d), a profit p(d), a height h(d) and the resource to which 
it belongs. Recall that the time-line can be viewed as a tree-network with n + 1 vertices. In other 
words, the case of line-networks can be reduced to the case of tree-networks. Therefore, Theorem 
15.31 and 16.31 apply to the case of line-networks. We next show how to improve these results in the 
case of line networks. The improvements are obtained by designing layered decompositions with 
better parameters. 
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Improved Layered Decomposition 

In the case of tree-networks, we derived decompositions with critical set size A = 6 and length 
£ = O(logn). In the case special of line-networks, we show how to construct decompositions with 
parameters A = 3 and t = 0(log(L max /L m i n )) (L max and L m ; n are the maximum and minimum 
length of the demand instances). We note that this decomposition is implicit in [16]. 

Partition the demand instances in to I = |~log(L max /L m i n )] categories based on their length, 
where the length of a demand instance is len(d) = e(d) — s(d) + 1. For 1 < i < £, define Gi = 
{d : 2 i - 1 L min < len(d) < 2 { L m ; n . Define an ordering o~ — G%, G2, ■ ■ ■ , Gg. For each demand 
instanced, let mid(d) = [(s(d) + e{d))/2\ be the mid-point of d. Define i:(d) = {s(d),mid(d),e(d)}, 
for all d £ V. It is not difficult to argue that the pair (a, tt) forms layered decomposition. 

Unit Height Case 

Fix e > 0. We modify the distributed algorithm given in Section [5] to use the above layered 
decomposition. We suitably change the value of £ to 8/9 (instead of 14/15). This algorithm would 
satisfy the interference property with A = 3 and A = (1— e). So, the approximation ratio is 4/(1 — e). 
The number of (communication) rounds is 0(Time(MIS) 4og(l/e) dog(L max /L m ; n ) 4og(p max /p m i n ). 
We have established the following result. 

Theorem 7.1 There exists a distributed algorithm for the scheduling problem for the unit height 
case of line-networks with windows with approximation ratio (4 + e). The number of (communica- 
tion) rounds is at most O ^Time(MIS) log \ log log , 

Arbitrary Height Case 

Here, we discuss the arbitrary height case. As in Section [6l we partition the set of demand instances 
into narrow and wide categories. For the case of wide instances, Theorem 17.11 applies and yields an 
algorithm with an approximation ratio of (4 + e). For the case of narrow instances, the algorithm 
is similar to that of Lemma 16.21 the only change is that we set the parameter f = -rrr — , for a 
suitable constant d . This way we get an algorithm with an approximation ratio of 19 + e (because, 
in the current setup A = 3). We obtain an overall algorithm by combining the solutions output 
by the above two algorithm; the idea is same as that of Theorem 16.31 We have established the 
following result. 

Theorem 7.2 Fix any h m i n < 1/2 and e > 0. Consider the scheduling problem on tree-networks 
with arbitrary heights wherein all the demands a satisfy h(a) > h m { n . There exists a distributed 
algorithm for the above problem with approximation ratio (23 + e). The number of (communication) 
rounds is at most O ( Tl ™ e ( MIS ) i Q g n \ g I \ Q g Essi ) _ 
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Begin 

/ / Initialize 

For all a £ A, set a(a) = 0; for all e G £ , set /3(e) = 0. 
Initialize an empty stack. 

// First phase 
For i = 1 to r 
While 

Let U = {d G T>(Ti) : dual constraint of ci is unsatisfied}. 
If [/ = 0, exit the loop. 

Let d be the element of U appearing earliest in <t(T,). 
Compute slackness: s = p(d) — a(aa) — J2 e -d~e <^( e )- 
Compute: 6(d) = s/(\ir(d)\ + 1). 

Raise the variables: ot(ad) <— a(o.d) + 6(d); for all e G 7r(d), /3(e) •<— /3(e) + 5(d). 
Push d into the stack. 

/ / Second Phase 
5 = 0. 

While(stack not empty) 

Pop the top element d of the stack 
If S U {d} is an independent set, then add d to S. 
Output S. 
End 



Figure 8: Sequential Algorithm: Pseudocode 

A A Sequential Algorithm for Tree- networks 

Here, we present a sequential algorithm satisfying the interference property with critical set size 
A = 2 and slackness parameter A = 1. 

Let T = {Ti, T2, . . . , T r } be the given set of tree- networks. For 1 < i < r, construct a rooted tree 
Hi by arbitrarily selecting a node gi E V and making gi as the root of Tj. Let % = {-Hi, ^2, • • ■ , H r } 
be the set of rooted trees constructed by the above process. 

Consider an input tree-network T € T ■ Let H G % be the rooted tree corresponding to T with 
g as the root. For a node x, define its depth (or height) to be the number of the nodes along the 
path from g to x; the root g itself is defined to have to depth 1. For a demand instance d G T>(T), 
let fj,(d) denote the node in H having the least depth among all the nodes appearing in path(d). 
We say that d is captured at the node 11(d); observe that n(d) is uniquely determined. See Figure 
[6l A rooted-tree H has been constructed by picking the node 1 as the root. The demand instance 
d = (4, 13) will be captured at the node fi(d) = 2. With respect to H, a node y is said to be an 
ancestor of x, if y appears along the path from g to x; in this case, x is said to be a descendent of 
y. By convention, we do not consider x to be an ancestor or descendent of itself. 

Consider a demand instance d G V(T). We have that path(d) passes through the node fi(d). 
Let n(d) denote the set of edges of path(cZ) that are adjacent to fi(d). If (j,(d) is one of the end-points 
of d then n(d) will have only one edge; otherwise, it will have two edges. See Figure [U The set 
Tr(d) for the demand instance d = (4, 13) is given by ir(d) = {(2,4), (2,5)}. We can now make the 
following important observation. 
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Observation A.l Consider any two overlapping demand instances d\,d% € T>{T). If d\ is captured 
at a node z and d^ is captured at an ancestor of z, then path(c?2) will include one of the edges from 
7r(di). Furthermore, if d\ and di are captured at the same node z, then also path^) will include 
one of the edges from ir(di). 

Based on the above observation, our algorithm works as follows. In the two-phase framework, 
in each iteration, we need to determine an independent set I. We shall perform the above task by 
selecting a singleton demand instance d (forming a trivial independent set); we can afford to do 
this, since we are designing a sequential algorithm. Towards determining d, we define an ordering 
of the demand instances for each tree-network. Consider a tree-network T € T and let H be the 
rooted tree corresponding to T with g as the root. Order the demand instances d € T>(T) in the 
descending order of the depth of fJ,(d); ties are broken arbitrarily. Thus, demand instances captured 
at the bottom-most leaves of H will be placed first and those captured at the root will be placed 
last. Let a(T) denote the ordering obtained by the above process. 

The first phase works in r rounds, where the ith round will process the tree-network Tj. The 
round i works in multiple iterations, where each iteration is performed as described below. Let U 
be the demand instances belonging to T, whose constraint is unsatisfied (recall that A = 1). Among 
demand instances in U, pick the demand instance d appearing earliest in the ordering c(Tj) and 
raise the demand d: namely, raise the dual variable a(ad) and the dual variables of the edges found 
in ir(d). The ith round is completed when the dual constraints of all the demand instances of T>(Ti) 
are satisfied. The pseudocode of the algorithm is presented in Figure El 

Observation IA.l1 shows that the algorithm satisfies the interference property. Furthermore, for 
any demand instance d, |vr((i)| < 2. Thus, we have a two-phase procedure with parameters A = 2 
and A = 1. Lemma 13.11 shows that the algorithm is a 3-approximation algorithm. In the special 
case where there is only one tree-network, we do not need to raise the dual variables a(-) and the 
approximation ratio can be improved to 2. The resulting algorithm would essentially be the same 
as that of Lewin-Eytan et al. [13) . The round complexity of the algorithm can be as high as n, 
since only a single demand instance is raised in each iteration. 
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